Vocalization related cortical fields following a simple vowel vocalization 
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1 Introduction 

Since McAdam and Whitaker [1] reported the 
Bereitschaftpotential (BP) prior to the speech onset, 
numerous studies of speech mechanisms have been 
carried out in both the neurological and 
psychophysiological fields. Vocalization-related 
cortical potentials (VRCP) are considered to involve 
sensory, motor and auditory components, and these 
components are affected by the cognitive processes 
for speech [2, 3], Although several components of 
VRCP have been reported [2, 3, 4], the nature of 
their generators is not fully understood. Positron 
emission tomography (PET) [5, 6, 7] and functional 
magnetic resonance imaging (fMRI) [8] have 
demonstrated the probable brain regions engaged in 
the language production processes, but they revealed 
little about the temporal relationship of activities 
among the regions. 

Magnetoencephalography (MEG) has several 
theoretical advantages in identifying the localization 
of cortical dipoles with high spatial resolution as 
compared with electroencephalography (EEG) [9, 
10], We have surmised that vocalization related 
cortical magnetic fields (VRCF) is comprised of 
multiple generator sources, similarly to the sources 
for movement related cortical magnetic fields 
(MRCF) observed following simple finger 
movement [9, 11], Therefore, we used a brain 
electrical source analysis (BESA) for the 
spatiotemporal multiple dipole analysis of the VRCF. 
BESA attempts to separate and to image the source 
current waveforms in addition to localizing multiple 
equivalent current dipoles (ECDs). 

The objective of the present study was to evaluate 
the generator sources of VRCF just before and after 
vocalization onset. Since we focused on the sensori¬ 
motor processes just before and during a simple 
vowel vocalization, we analyzed the cortical activity 
in the period from 150 ms before to 150 ms after 
vocalization onset, and obtained a good model of 
VRCF with six ECDs. We reported the result [12], 
and summarized them in this article. 


2 Methods 

2.1 Subjects 

The subjects were eight normal right-handed 
volunteers (5 males and 3 females; mean age 32 
years, range 24-37 years). 

2.2 Experimental protocol 

For the experiment, the subject rested comfortably 

on his/her side on a bed in a magnetically shielded 
room. The subject was asked to vocalize a vowel 
(/u/) repeatedly, at a short self-paced random 
interval of about 5.0 s, with the jaw relaxed and 
slightly open and the tongue on the floor of the 
mouth. The subject practiced making the sound 
before the recording. 

The trigger for averaging the epoch was generated 
from the vocalization sound wave. In order to obtain 
an appropriate trigger, we used two microphones to 
pick up the subject’s vocalization. One microphone 
(Mic.l) was placed close to the subject’s mouth, and 
the other (Mic.2) was placed 30 cm above the mouth. 
The vocalization sound wave was amplified with a 
time constant of 0.3 s and a high-frequency cutoff 
less than 1 kHz. Mic. 1 obtained a sharp onset of the 
sound waveform, but it also detected the noise from 
breathing or other sounds. Mic.2 recorded only the 
vocalization sound, but the onset of the waveform 
was relatively dull. Thus, we compared the sound 
waveforms recorded by Mic.l and Mic.2, and 
decided the trigger point at the onset of waveform 
from Mic.l. The vocalization sound was 
simultaneously picked-up. One recording block 
consisted of 30 trials of the vocalization, and five 
blocks were tested for each subject with a few 
minutes’ rest between blocks. We also recorded the 
subject’s electro-oculogram (EOG) of the left eye. 
The electrodes for EOG were placed 1.5 cm above 
the superior orbicular edge and 2.0 cm outside of the 
lateral angle of the eye with a time constant of 3.0 s 
and a high frequency cutoff less than 100 Hz. 

2.3 MEG recording 

The VRCF was measured with a dual 37-channel 
biomagnetometer (Magnes, Biomagnetic 
Technologies Inc., San Diego, CA). The detection 



coils of the biomagnetometer were arranged in a 
uniformly distributed array in concentric circles over 
a spherically concave surface. Thus, all of the sensor 
coils were equally sensitive to weak magnetic 
signals from the brain. Each device was 144 mm in 
diameter and its radius of curvature was 122 mm. 
The outer coils in each device were 72.5 degree 
apart. Each coil was connected to a superconducting 
quantum interference device (SQUID). 

The biomagnetometers were centered at around C3 
and C4 of the International 10-20 system in each 
subject. These sites were measured at the same time 
using two probes. The magnetic fields were 
recorded with a relatively narrow band-pass filter, 
0.1-100 Hz. The recordings were digitized during an 
analysis time from -1700 ms to +300 ms at a 
sampling rate of 1024 Hz. The baseline was 
corrected (DC offsets) for each channel according to 
the mean value of the signal between -1700 ms and 
-1400 ms. The epochs with artifacts of eye 
movement and so on were rejected before averaging. 
Epochs with eye movement and other artifacts were 
rejected before averaging. 

2.4 Brain electric source analysis (BESA) 

BESA was modified for our 2x37 channel 
magnetometer placed in bilateral hemispheres. This 
method allows the spatiotemporal modeling of 
multiple simultaneous sources over defined intervals. 
A strategy for localizing the generators of the VRCF 
was applied independently to the waveforms from 
each subject. At each step, the localization and 
orientation of dipoles were calculated by an iterative 
least-square fit. The goodness of fit (GOF) was 
expressed as a percentage of the variance, and signal 
epochs for source analysis were defined on the basis 
of the global field power (GFP) [13]. We considered 
that the adaptation of the dipoles to be acceptable 
when the RV (100-GOF (%)) was less than 10 %. 

We selected the period for analyzing the 
spatiotemporal dipole modeling. We used the 
interval of -150 ms to +150 ms, because this interval 
covered the activities from the preparation to move 
the vocal cords to feedback regarding sufficient 
vocalizing. In addition, we focused on activities just 
before and after the vocalization onset, rather than 
long-standing and complicated magnetic fields 
corresponding to BP. 

3 Results 

All of the subjects performed the vocalization tasks 
successfully with minimal face movement and 
blinking. 


3.1 VRCF evoked by vocalization 

The artifact-free epochs were 130+20 out of 150 
trials in total for one subject. The VRCF was clearly 
identified from approximately -150 ms, and a peak 
of reversed-phase deflection was recognized at 
about +100 ms (Figure 1). These treads of 
waveforms were similarly recorded in the bilateral 
hemispheres in all subjects. 


Subject 1 
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Figure 1: The VRCF waveforms before and after the 
vocalization at the left and right hemispheres 
(around C3 and C4) in subject 1. The waveforms 
recorded from 3 7 channels were superimposed. 

3.2 Spatiotemporal source analysis 

We were able to obtain low %RV with six ECDs 
during the period from -150 to +150 ms. The 
location and orientation of the six sources in 3 
representative subjects are shown in the right part of 
Figure 2. The dipole direction shown in the spherical 
head models means the positive direction of the 
waveforms. Sources 1 and 2 were located in the area 
inferior to the hand motor area of the left and right 
hemisphere, respectively. Sources 3 and 4 were 
located in the auditory area, the upper part of the 
temporal lobe, of the left and right hemispheres, 
respectively. The locations of sources 5 and 6 were 
in the motor area around truncal motor areas of the 
left and right hemispheres. The RV of this six-dipole 
model was low enough, i.e., in seven subjects 
(mean±SD 9.0+1.76), but one subject shows 
relatively high RV (15.4 %). These six sources 
were obtained in the same areas in all subjects. We 
tried to find additional sources, but no source 
showed an appropriate waveform or improved the 
RV. 






Figure 2: The results of six-sources analysis of 
subject 1. Left: temporal activity of each source 
obtained BESA. Right: the localization and the 
orientation of each dipole on the spherical head 
model. The line from each point indicates the 
direction of the dipole current. Sources 1 and 2 
were located in the laryngeal motor areas, and 
sources 3 and 4 were in the auditory areas. 
Sources 5 and 6 were located in the truncal motor 
areas. The sources in the motor areas (sources 1, 
2, 5, 6) were activated approximately prior to the 
vocalization onset, while the activity of the auditory 
sources (sources 3 and 4) appeared after the 
vocalization onset. All six sources were temporally 
overlapped after the onset of vocalization. 


The time courses of the activities of sources 3 and 4 
were clearly different from those of the other 
sources in the left part of Figure 2. The major peak 
of source 3 and source 4 appeared after the 
vocalization onset, while the other sources’ onset 
was 150-200 ms before vocalization. Flowever, the 
mean peak latency of all 6 sources overlapped 
approximately 50-100 ms after the vocalization 
onset, indicating that activities in the motor cortices 
continued after the vocalization onset. 


4 Discussion 

We confirmed that the VRCF involves (1) the 
activities in the motor cortices which were probably 
responsible for the initiation and preparation for 


vocalization, and (2) the activities in the auditory 
cortices related to self-vocalization. The ECDs 
estimated in the source model were in the laryngeal 
and truncal areas of the ‘homunculus’ of the sensori¬ 
motor cortex [14], and in the auditory cortex. 

The reason for the restricted location of sources 1 
and 2 given by the BESA is based on the evidence 
for bilateral laryngeal activation of the motor cortex 
during vocalizations studied by PET [5, 6] and fMRI 
[8], The peak latencies of these components were 
extended slightly after the vocalization onset. In 
studies of the MRCP [15] and MRCF [9], the peak 
latency of motor activity was just after the 
movement onset. Thus, we considered that sources 
1 and 2 mainly involved the motor activity of 
laryngeal muscle and/or vocal cords. 

Small components were observed in the auditory 
cortex, on the inferior bank of the Sylvian fissure 
(sources 3 and 4). The responses of those sources 
started just after the vocalizations onset, while the 
activation of the sources in motor cortices appeared 
from about 250-150 ms before the vocalization. 
The peak latencies of sources 3 and 4 were both 
about 100 ms after the vocalization onset. The 
previous study [16] reported that the peak latency of 
the brain potential after hearing human voice was 
approximately 100 ms (Nl). Since the locations of 
sources 3 and 4 were similar to the auditory area 
estimated in other study of speech sounds [17]. 
Therefore, we considered that the activities of 
sources 3 and 4 corresponded to the activity related 
to the response of subjects central auditory system to 
his/her own voice, probably including N1 activity. 
Each activity in motor area had also a peak after the 
vocalization onset, but we considered that it was 
sustained motor activity in each area. The temporal 
changes in the activities in sources 5 and 6 appeared 
to be very similar to those of sources 1 and 2, and 
their location seemed to be in the truncal part in the 
homunculus of the motor cortex in the bilateral 
hemispheres. This finding indicated that the truncal 
muscles including the diaphragm contribute to 
produce the voice. 

As for the lateralization of activities, sensori-motor 
function related to the face is known to be bilateral, 
and bilateral sides of truncal muscles should be 
activated during vocalization. Therefore, the 
symmetrical activity during vocalization found in 
the present study seems compatible with current 
anatomical and physiological knowledge. 

Some investigators have reported the activation of 
Broca’s area during syllable or word vocalization [3, 
17], However, since we used a simple task of vowel 
vocalization rather than a meaningful syllable or 










word, the cortical activities were symmetrically 
identified without any significant activities of 
Broca’s area. 

In conclusion, we analyzed the VRCF recorded just 
before and after vocalization onset, using MEG and 
a BESA. We successfully identified the sources in 
the laryngeal and truncal motor area which were 
activated prior to the vocalization onset, and other 
sources in the auditory cortex which were activated 
just after the vocalization onset. All sources were 
found symmetrically in the bilateral hemispheres. 
We speculate that these activities are responsible for 
the descending motor volley and the central auditory 
processes during vocalization. 
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